Evolution of meta-parameters in reinforcement learning algorithm

نویسندگان

  • Anders Eriksson
  • Genci Capi
  • Kenji Doya
چکیده

In most Reinforcment Learning approches, the metaparameters such as learning rate and ”temperatur” for exploration are adjusted manually. In order to build fully autonomous learning agents, it is important to develop methods for adjusting these parameters to match the demands of the task and the environment. In this paper, we propose a new method to determine the values of meta parameters in reinforcement learning, based on evolutionary approach. Simulations and experimental results with the Cyber Rodent robot show that meta parameters have a strong effect on agent performance and they are strongly related with each-other.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolution of Meta-parameters in Reinforcement Learning

A crucial issue in reinforcement learning applications is how to set meta-parameters, such as the learning rate and ”temperature” for exploration, to match the demands of the task and the environment. In this thesis, a method to adjust meta-parameters of reinforcement learning by using a real-number genetic algorithm is proposed. Simulations of foraging tasks show that appropriate settings of m...

متن کامل

Neural Networks letter Meta-learning in Reinforcement Learning

Meta-parameters in reinforcement learning should be tuned to the environmental dynamics and the animal performance. Here, we propose a biologically plausible meta-reinforcement learning algorithm for tuning these meta-parameters in a dynamic, adaptive manner. We tested our algorithm in both a simulation of a Markov decision task and in a non-linear control task. Our results show that the algori...

متن کامل

Optimum Parameters for Tuned Mass Damper Using Shuffled Complex Evolution (SCE) Algorithm

This study is investigated the optimum parameters for a tuned mass damper (TMD) under the seismic excitation. Shuffled complex evolution (SCE) is a meta-heuristic optimization method which is used to find the optimum damping and tuning frequency ratio for a TMD. The efficiency of the TMD is evaluated by decreasing the structural displacement dynamic magnification factor (DDMF) and acceleration ...

متن کامل

Embodied Evolution of Learning Ability

Embodied evolution is a methodology for evolutionary robotics that mimics the distributed, asynchronous, and autonomous properties of biological evolution. The evaluation, selection, and reproduction are carried out by cooperation and competition of the robots, without any need for human intervention. An embodied evolution framework is therefore well suited to study the adaptive learning mechan...

متن کامل

Online Meta-learning by Parallel Algorithm Competition

The efficiency of reinforcement learning algorithms depends critically on a few metaparameters that modulates the learning updates and the trade-off between exploration and exploitation. The adaptation of the meta-parameters is an open question in reinforcement learning, which arguably has become more of an issue recently with the success of deep reinforcement learning in high-dimensional state...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003